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Abstract 

Complexes of dysprosium, holmium, and erbium find many applications as single-molecule magnets, as contrast agents for 
magnetic resonance imaging, as anti-cancer agents, in optical telecommunications, etc. Therefore, the development of tools 
that can be proven helpful to complex design is presently an active area of research. In this article, we advance a major 
improvement to the semiempirical description of lanthanide complexes: the Recife Model 1, RMI, model for the 
lanthanides, parameterized for the trications of Dy, Ho, and Er. By representing such lanthanide in the RMI calculation as a 
three-electron atom with a set of 5 d, 6 s, and 6 p semiempirical orbitals, the accuracy of the previous sparkle models, 
mainly concentrated on lanthanide-oxygen and lanthanide-nitrogen distances, is extended to other types of bonds in the 
trication complexes' coordination polyhedra, such as lanthanide-carbon, lanthanide-chlorine, etc. This is even more 
important as, for example, lanthanide-carbon atom distances in the coordination polyhedra of the complexes comprise 
about 30% of all distances for all complexes of Dy, Ho, and Er considered. Our results indicate that the average unsigned 
mean error for the lanthanide-carbon distances dropped from an average of 0.30 A, for the sparkle models, to 0.04 A for the 
RMI model for the lanthanides; for a total of 509 such distances for the set of all Dy, Ho, and Er complexes considered. A 
similar behavior took place for the other distances as well, such as lanthanide-chlorine, lanthanide-bromine, lanthanide, 
phosphorus and lanthanide-sulfur. Thus, the RMI model for the lanthanides, being advanced in this article, broadens the 
range of application of semiempirical models to lanthanide complexes by including comprehensively many other types of 
bonds not adequately described by the previous models. 
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Introduction 

Lanthanide complexes, as is well known, have a wide range of 
high technology applications. Of particular importance is the 
discovery that, due to their slow magnetization relaxation, 
lanthanide mononuclear complexes may function as single- 
molecule magnets [1,2], the ultimate size limit for spin-based 
devices. Dysprosium complexes, in particular, will be very 
important in the development of magnetic materials because of 
recent results leading to the highest relaxation energy barriers for 
multinuclear clusters [3,4], the highest temperature at which 
hysteresis has been observed for any single complex [5], and a 
record magnetic blocking temperature of 8.3 K at a sweep rate of 
0.08 Ts-1 [6]. Future research, for example, might be directed 
towards the design of dysprosium complexes that may operate as 
single-molecule magnets capable of preserving their magnetization 
at higher and more practical temperatures [6]. Dysprosium 
complexes are therefore promising for optical storage and 
memory. 

Not only that, both dysprosium and holmium complexes can 
also effectively function in magnetic resonance imaging, MRI, as 
negative contrast agents at high magnetic fields, producing darker 



images, and as agents for susceptibility-induced enhancement at 
low magnetic fields [7]. Indeed, they are complementary to 
gadolinium complexes, which act as positive contrast agents, 
which brighten the image. Indeed, through the simultaneous 
apphcations of gadolinium and dysprosium based contrast agents 
to the MRI diagnosis of conditions such as ischemic heart disease, 
unprecedented details can now be revealed [8,9]. Future efforts 
will likely be intensified towards the design of such MRI contrast 
agents for the imaging of cellular molecular events involved in 
normal and pathological processes, including site specific macro- 
molecular and particulate delivery systems [7]. 

Holmium is also employed in cancer therapeutics due to the 
characteristics of its "Ho isotope and of its complexes, like 
I66H0-DOTMP which has been used in combination with 
chemotherapy in the treatment of myeloma because it concen- 
trates in metastases of the skeleton and irradiates bone marrow 
[10]. ^ 

Erbium (III) luminesces at 1.55 |J.m, essentially at the center of 
the third telecommunication window at 1.540 nm. Hence, erbium 
has been used in long-distance optical transmissions, power 
amplifiers, repeaters, etc. However, inorganic materials doped 



PLOS ONE I www.plosone.org 



1 



January 2014 | Volume 9 | Issue 1 | e86376 



RMI Parameters for Dysprosium, Holmium and Erbium 



with erbium, display a very narrow full width at half maximum 
[1 1]. In order to increase the band width, erbium complexes have 
been used in order to both protect the erbium ion from vibrational 
coupling, at the same time enhancing the absorption of light 
through the so-called antenna effect. Indeed, erbium complexes 
have been prepared that exhibit a much broader fuU width at half 
maximum of 68 nm [12], a significant broadening when 
compared to erbium implanted silica which has a typical value 
of 1 1 nm for its most intense peak. 

Thus, the design of lanthanide complexes towards enhancement 
of the property of interest, while seeking to avoid eventual side 
effects to the health of the subject (where applicable) is an active 
area of research, which may largely benefit from quantum 
chemical tools that attempt to predict several of the physical, 
chemical and even pharmacological [13] properties of the 
conjectural new structures being considered; structures which 
might be sketched by assembling around the lanthanide ion, 
ligands, selected from a library of ligands in a combinatorial 
manner. And the most important information, from which 



Table 1. Parameters* for the RMI model for the trications of 
Dy, Ho and Er. 



RMI 



Parameter Unit 








u,. 


eV 


-20.92623973 


-22.05745867 


-21.97839904 




eV 


-7.66730575 


-7.59563761 


-7.60784986 


Um 


eV 


-17.94081525 


- 1 8.00040589 


-17.97684107 


L 


bohr"' 


1 .29527540 


1.33055043 


1.34775672 


Cp 


bohr"' 


1.91210659 


1.77955939 


1 .80648084 


Cd 


bohr"' 


1.41339670 


1.53652417 


1.46618905 




eV 


-7.60670536 


-5.64522644 


-5.63471034 




eV 


1.96173362 


0.00653676 


-0.01897203 




eV 


-4.36852734 


-4.31289917 


-4.25067889 




eV 


8.30543139 


8.24056943 


8.25732681 




eV 


1.31036509 


1.24543189 


1.24874510 


Pcore 


bohr 


1.62505501 


1.71955962 


2.71713627 


a 




1.34825876 


1.33007543 


1.32010273 


Cs' 


bohr"' 


1.37236617 


1 .49803844 


1.44675714 




bohr"' 


1 .07407253 


1.96749739 


1.97388315 


u 


bohr"' 


0.81914360 


0.66302146 


0.65046083 




none 


1.13071544 


1.09070756 


1.17417665 


b, 


A-^ 


7.71195583 


7.57151625 


7.58325164 




A 


1.53665819 


1 .4909541 1 


1.50354881 




none 


0.06845575 


0.00141941 


0.00864571 


b2 


A-^ 


7.50653990 


7.79969636 


7.81378785 


C2 


A 


3.23417102 


3.25425084 


3.23359665 



All these parameters are as defined in the formalisms and equations of the RMI 
model. 

*Parameters are s, p, and d atomic orbital one-electron one-center integrals Uss, 
Upp and Udd; the s, p, and d Slater atomic orbital exponents i;^, ^p, and c^; the s, 
p, and d atomic orbital one-electron two-center resonance integral terms fi^, fip, 
and 1^^; the core-core repulsion term a; the two-electron integrals F°sd- G^sd-' and 
the additive term Pcore needed to evaluate core-electron and core-core nuclear 
interactions; the second set of exponents to compute the one-center integrals 
Cs', Cp', and Cd'; and the six parameters for the two Gaussian functions: height, aj; 
inverse broadness, bi; and displacement, C|. 
doi:l 0.1 371/journal.pone.0086376.t001 



essentially all quantum chemical property predictions derive, is 
an accurate geometry of the molecular structure of the complex. 

Predictions of geometries of lanthanide complexes from ab initio 
calculations are not so easy due not only to significant relativistic 
effects, a consequence of their high atomic numbers, but also to 
the complex manifold of microstates due, not only to a partially 
filled f-shell, but also from possibly partially filled 5 d 6 s and 6 p 
shells [14]. Therefore, full geometry optimizations from such first 
principles calculations are essentially unfeasible for the technolog- 
ically useful complexes, which usually exhibit sizes of the order of 
100 atoms or more. As a consequence, effective core potentials 
arise as a practical and very efficient manner of circumventing the 
complexity, while retaining important characteristics of ab initio 
calculations. Of these, the most widely used are the relativistic 
pseudopotentials of Dolg [15,16] which represent an excellent 
compromise between accuracy and usage of computational 
resources, mainly computing time. So far, the most thorough 
study of the geometry prediction accuracy of these relativistic 
potentials has been carried out by our research group in 2006 
when fuU geometry optimizations were carried out on 52 different 
lanthanide complexes, including complexes of dysprosium (III), 
holmium (III) and erbium (III). [17] The counterintuitive results 
obtained indicated that the best combination of method with basis 
set when using the MWB pseudopotential was RHF/STO-3G 
when the intent of the calculation was to predict the geometry of 
the coordination polyhedron - very important for any subsequent 
ligand field application. Moreover, either increasing the basis set, 
or adding electron correlation, only worsened the quality of the 
resulting coordination polyhedron. On the other hand, although 
the quality of the obtained coordination polyhedron via RH/ 
STO-3G was very good, that could not be said of the geometry of 
the attached organic ligands. 

In 1 994, we introduced the Sparkle Model for the calculation of 
lanthanide complexes [18,19], a semiempirical model within the 
framework of the AMI semiempirical model [20], which replaced 
the lanthanide ion by a -l-3e charge, with the corresponding 
Coulomb field superimposed to a repulsive potential of the form 
exp(-o(r), with a being a parameter designed to somewhat delineate 
the size of the lanthanide ion, preventing the implosion of the 
ligands towards it. A very useful method of obtaining absorption 
spectra of lanthanide complexes was subsequently published [21]. 
Later [22], Gaussians were added to the core-core repulsion of the 
sparkle-ligand atom to make the Sparkle Model more consistent 
with the AMI. In 2005, based on a parameterization scheme 
employed for europium, gadolinium and terbium [23], the first 
useful and accurate semiempirical model for dysprosium was 
defined [24], followed by holmium [25]; and in 2006 for erbium 
[26]. These models were defined for AMI, and became later 
available in MOPAC2007 [27], the overall model being called 
Sparkle /AMI. So far, most applications of the Sparkle Model are 
related to luminescence research [28-30]. But since different 
semiempirical models possess different accuracies and eventually 
develop particular niches of applications, it soon became a 
necessity to extend the Sparkle Model to others, giving rise to 
Sparkle/PM3 [31,32], Sparkle/PM6 [33], Sparkle/PM7 [34], 
targeted to solids, and Sparkle/RMl [35]. 

However, none of the above mentioned Sparkle Models 
attaches semiempirical atomic orbitals to the lanthanide ion. 
Nevertheless, these models are all very accurate to describe 
lanthanide-ligand atom distances when the coordinating atom of 
the ligands is another lanthanide, oxygen or nitrogen. By moving 
towards other types of lanthanide-ligand atom bonds, however, the 
accuracy of the Sparkle Models starts to wane. All that points out 
to the fact that there is some degree of overlap between the orbitals 
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Table 2. Unsigned mean errors, UIVlE(Dy.L)S and UIVlEs, for the 
RMI model for the lanthanides, as compared to the respective 
experimental crystallographic values, obtained from the 
Cambridge Structural Database, [37-39] for each of the 61 
dysprosium (III) complexes. 





Complex* 


RMI 




Complex* 


RMI 






UME|Dy-L)S 


■ IMP IA\ 




UME,Dy.L,s (A) 


UME 


AMAQDY 


0.0517 


0.1503 


AKUKAT 


0.0216 


0.1148 


BAFZUE 


0.0470 


0.1 129 


BEXLIA 


0.0300 


0.0972 


BIHLIN 


0.0317 


0.0719 


DAMPEN 


0.0410 


0.0953 


BUVXIZOl 


0.0658 


0.0872 


DEKBEB 


0.0438 


0.0832 


CECLIF 


0.0522 


0.0959 


DEKBEBOl 


0.0370 


0.0843 


CECLIF10 


0.0521 


0.0960 


DEKCAY01 


0.0632 


0.1068 


DIBTIR 


0.0313 


0.0977 


DUFCOWIO 0.0348 


0.1042 


DIDBOH 


0.0417 


0.1567 


FIGXEZ 


0.0327 


0.1403 


FOPNAZ 


0.0138 


0.1471 


FUXPAPOl 


0.1191 


0.1026 


FUXRAR 


0.0401 


0.1662 


HIVWEP 


0.0665 


0.1222 


GAKYEW 


0.0258 


0.1273 


MUL YUU 


0.0942 


0.0976 


GINPUO 


0.0455 


0.0813 


IMOXAJ 


0.0271 


0.1036 


HANCAA 


0.0553 


0.1424 


KILZOU02 


0.0487 


0.1307 


KITGEZ 


0.0480 


0.1391 


KUYBIP 


0.1338 


0.1260 


LEYHUS 


0.0296 


0.2200 


LEZZOG 


0.0343 


0.0741 


MANHOY 


0.0804 


0.1208 


MECCUT 


0.0408 


0.1836 


PALBIN 


0.0205 


0.4919 


NAKMAO 


0.0323 


0.0713 


QQQEMMOl 


0.0328 


0.0841 


NAPHAN 


0.0377 


0.0883 


SETADY 


0.0883 


0.2516 


OHUYUM 


0.0286 


0.0985 


TISQUH 


0.0670 


0.1240 


RABBEX 


0.0825 


0.2034 


TUQTUU 


0.0601 


0.0940 


ROCTIN 


0.1118 


0.2276 


TUQTUUOl 


0.0601 


0.0943 


TESHEF 


0.0348 


0.1448 


VOSBOU 


0.0587 


0.0879 


TESHOP 


0.0571 


0.1010 


XAWVIA 


0.0408 


0.1087 


TESJEH 


0.0356 


0.1150 


XEQMAH 


0.0987 


0.1019 


TESJIL 


0.0564 


0.1008 


XIVFUD 


0.1015 


0.2004 


USEPEO 


0.0484 


0.1090 


YAVSOD 


0.0432 


0.0726 


WAQZEU 


0.1499 


0.1742 


ZAXSAS 


0.0700 


0.1357 


WAWJOTOl 


0.1152 


0.1538 


ZZZARGOl 


0.0602 


0.0897 


WEDHUJ 


0.0282 


0.1089 


AGUTOL 


0.0719 


0.1019 


XAYRIZ 


0.0416 


0.0874 


AHANED 


0.0817 


0.1433 









*The complexes are identified by their unique CSD codes [37-39]. 
doi:1 0.1 371 /journal.pone.0086376.t002 



of the lanthanide and those of the coordinating atoms - in short, 
there is a degree of covalence not taken into account by the 
Sparkle Model. 

In this article, in order to considerably broaden the range of 
applications of semiempirical methods for lanthanide complexes, 
we introduce a new model with orbitals for the lanthanide 
trications of dysprosium, holmium, and erbium, within RMI [36], 
which we call simply RMI model for the lanthanides, a 
significandy more general model, not to be confused with 
Sparkle/RMl [35] which does not have orbitals associated with 
the lanthanide ion. 



Table 3. Unsigned mean errors, UME(ho-l)S and UMEs, for the 
RMI model for the lanthanides, as compared to the respective 
experimental crystallographic values, obtained from the 
Cambridge Structural Database, [37-39] for each of the 40 
holmium (III) complexes. 



Complex^ 


Method RMI 




Complex^ 


M(>thnd RMI 






UME(Ho-L)S 

(A) 


UME (A) 




UME,H„.L|S (A) 


UME 

(A) 


dAodAIn 


0.051 3 


0.1 131 


aAKvUd 


0.0363 


0.091 2 


BEYSAZ 


0.0529 


0.1 976 


aAVvVUu 


0.0530 


0.0930 


Di i\/v/~icni 
dUV AUrU 1 


0.0634 


0.0869 


VCf~\tiA CI 


0.0843 


0.0602 


^"A/~1CI l\/ 


0.0676 


0.0790 


AtVVVit 


0.0696 


0.1 305 


COZHEE 


0.0223 


0.0966 


Vf~\Df~ CC\ 
AUKutU 


0.0362 


0.4482 


f~\ ICVI 11/ 


0.0483 


0.231 8 


AuUvtU 


0.0443 


0.0838 


ECOJEL 


0.0975 


0.1 573 


A V A 7 A A 
AaAZ-AA 


0.0893 


0.1 1 37 


FAGYOC 


0.0695 


0.3482 


r^CL'D A V 

UtKbAA 


0.0378 


0.0768 


f~ A l/VI A 


0.0462 


0.1 312 


UtKbUL 


0.1 1 63 


0.1 557 


GINREA 


0.0460 


0.0775 


EWIPUV 


0.0365 


0.1275 


GODKOZ 


0.0358 


0.0523 


FUXRIZ 


0.0742 


0.0715 


HANCII 


0.0196 


0.0593 


GIFLIQ 


0.1142 


0.1489 


HOESUL02 


0.0621 


0.0916 


MOGWUB 


0.4893 


0.6047 


KITGOJ 


0.0254 


0.0773 


MUHWIW 


0.0491 


0.0888 


LEYJEE 


0.0413 


0.1874 


NUFQAG 


0.0399 


0.0715 


LIZPAL 


0.0499 


0.2272 


NUYNOL 


0.0689 


0.0652 


NIHRIF 


0.0640 


0.1070 


NUYNOLOl 


0.0484 


0.0764 


NUJBAV 


0.0087 


0.0625 


QELLOJ 


0.0493 


0.0937 


QOZVOQ 


0.0185 


0.1002 


QIVYAW 


0.0423 


0.0620 


SIFZIQ 


0.0273 


0.1721 


YEFVUA 


0.0349 


0.0760 



*The complexes are identified by their unique CSD codes [37-39]. 
doi:1 0.1 371 /journal.pone.0086376.t003 



Methods 

The rationale of the RM 1 model for the lanthanides starts with 
the following electron configuration for the lanthanide atoms: 
{[Xe]4r}5d' 6s^ with n = 9, 10, 11 for Dy, Ho and Er, 
respectively. The semiempirical core of the atoms then becomes 
{[Xe]4P}. The semiempirical valence shells wUl now have three 
electrons and will be described by 5 d, 6 s and 6 p orbitals, for a 
total of 9 orbitals. Hence the model will work for trications only, 
because for dications there would be a need to parameterize 
another core of the form {[Xe]4f^^'} and assign two electrons to 
the valence shells, although they could still be described by another 
set of 5 d, 6 s, and 6 p orbitals. Since trications are by far the most 
common form of lanthanide ions, as before, we expect the present 
parameterization to be able to tackle essentially all cases relevant 
to technological applications. 

The next step is to define the universes of complexes, one 
universe for each of the lanthanide ions under consideration. 
Accordingly, we selected from the Cambridge Crystallographic 
Database [37-39] all available complexes of high crystallographic 
quality (R <0.05), for a total of 61 of Dy(III), 40 of Ho(III), and 50 
of Er(III). 

We then proceeded to select sub-sets of complexes, the 
parameterization sets, according to some metric capable of 
guaranteeing that these sub-sets are representative of the universe 
of complexes with respect to some accuracy measure. Assuming 
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Table 4. Unsigned mean errors, UME(Er.L)S and UMEs, for 
Method RMI, as compared to the respective experimental 
crystallographic values, obtained from the Cambridge 
Structural Database, [37-39] for each of the 59 erbium (III) 
complexes. 



Complex* 


Metnod RMI 




Complex* 


Metnoa RMI 






UME|Er-uS 

(A) 


UME 

(A) 




UME,Er-L|S 

(A) 


UME 

(A) 


AERETS02 


0.0563 


0.0895 


RIKTEK 


0.0522 


0.2208 


AKIYEY 


0.0440 


0.2700 


RIKTEKOl 


0.0488 


0.2197 


BAGBER 


0.0462 


0.1076 


ROCSOS 


0.0782 


0.2444 


BEXLEW 


0.0842 


0.1678 


ROCTOT 


0.0595 


0.2138 


BOBWAQ 


0.0421 


0.1 184 


RUNQOG 


0.05 1 3 


0.1022 


BODMUD 


0.0388 


0.1382 


SEGVAB 


0.0691 


0.1681 


BOWXOA 


0.0450 


0.1848 


SOKBID 


0.0280 


0.0513 


DEKCEC 


0.0371 


0.0914 


TACERBOl 


0.0262 


0.0958 


DIBTAJ 


0.0369 


0.1778 


TEJFEU 


0.1355 


0.2048 


DIDCAU 


0.0399 


0.2601 


TEPKOO 


0.0297 


0.0715 


DIJQAO 


0.0391 


0.2332 


TMHDER 


0.0588 


0.3224 


DIJQIW 


0.0310 


0.1024 


TUMJEQ 


0.0583 


0.1227 


DIYNII 


0.0260 


0.1 158 


UFIRIK 


0.0622 


0.0716 


DOGKEP 


0.0323 


0.1502 


VEQFOM 


0.0820 


0.0617 


GAKYOG 


0.0266 


0.1057 


VOSNOG 


0.0375 


0.0833 


GINRIE 


0.0464 


0.0807 


VUSGUL 


0.0817 


0.0941 


HANCOO 


0.0165 


0.0609 


VUSHEW 


0.0593 


0.1 107 


HEDVIW 


0.1052 


0.1989 


WEFVIM 


0.0466 


0.0843 


HENAEB 


0.0283 


0.1548 


XAXYAX 


0.0593 


0.0854 


k'lTfl ID 


0.01 49 


0.0729 


AtVVVUl^ 


0.0442 


0.1 1 63 


KOZBUW 


0.0144 


0.0561 


XEWWUR 


0.0472 


0.0938 


LEYJII 


0.0294 


0.2276 


XOVHAS 


0.0851 


0.2268 


MAGDOP 


0.0449 


0.1319 


XOYXIS 


0.0285 


0.1590 


MECDEE 


0.0281 


0.0853 


YEGFEV 


0.0504 


0.1687 


NIVQUE 


0.0499 


0.2188 


YEMSIT 


0.0767 


0.1338 


NUYNUR 


0.0838 


0.2163 


YICCIW 


0.0344 


0.1527 


OHUZEX 


0.0269 


0.0890 


YUFWIG 


0.0196 


0.0988 


OMATUS 


0.0742 


0.1597 


ZADWUW 


0.1281 


0.2265 


QIVXID 


0.0742 


0.2170 


ZUFSAU 


0.0480 


0.1304 


RELNIG 


0.0530 


0.1011 









*The complexes are identified by their unique CSD codes [37-39]. 
doi:l 0.1 371/journal.pone.0086376.t004 



that any difTiculties Sparkle/AMl might be having in describing 
the coordination polyhedron of the complexes is a reasonable first 
order approximation to the eventual overall difficulty which the 
present model will encounter, we defined the following R; metric 
for each one of the i complexes of the universe for each lanthanide 
trication: 

where j runs over all types of bonds, e.g. Ln-N, Ln-O, Ln-C, Ln-S, 
Ln-P, etc; k, runs over all bonds of type j; cry^'is the standard 
deviation of all crystallographic bond lengths of type j for all 



complexes of the universe; <^,y™ is the crystallographic k' bond 
distance of type j for complex i; dfj'l^'is the calculated value of the 
same bond; a""^''^ is the standard deviation of all crystallographic 
bond angles of the type A-Ln-B, with A,B = O, N, C, S, CI, and 
Br; is the crystallographic 1* bond angle of complex I; and 

off'' is its calculated counterpart. The standard deviations were 
calculated from the experimental data only. We also found out 
that there was no need to split the angles into types, as they all 
formed a homogeneous set. The divisions of the errors by their 
corresponding standard deviations make sure that the summations 
in Eq. (1) add comparable terms. To the set of Ri values, each one 
associated with a different complex, we employed a hierarchical 
clustering analysis DIANA [40] . DIANA starts out with one large 
cluster containing all complexes. In the subsequent steps, the 
complexes that are the most dissimilar are split off into smaller 
clusters - a procedure which continues until each complex forms a 
cluster of itself From the resulting dendogram, we chose two sets 
of complexes as parameterization sets: a smaller and a larger one. 
For Dy(III) these sets contained 1 3 and 26 complexes, respectively. 
The corresponding numbers for Ho(III) were 12 and 20, and for 
Er(III) 16, and 39. 

The parameterization was carried out to minimize the sum of 
R;S for all complexes of parameterization set, with the difference 
that the calculated distances and angles in Eq. (1), are now the 
ones calculated by the model being parameterized. For the 
parameterization, we used a combination of Simplex and 
generalized simulated annealing [41] algorithms. We started with 
the smaller parameterization sets. Once these preliminary 
optimizations converged, we then expanded the parameterization 
sets to the larger ones and repeated the process until termination. 
Table 1 presents the final optimized parameters. 

Results and Discussion 

In order to evaluate the quality of the optimized parameters, we 
devised two measures [23,42]. Both are based on the following 
formula: 

UME.= ^-J2\R^f''-Rfr\ (2) 
'\i=i 

where UME stands for unsigned mean error; i refers to a given 
complex; n is the number of distances taken into consideration in 
the given complex; tiie superscript CSD indicates that the distance 
R is an experimental crystallographic distance taken from CSD, 
and the superscript RMI means that the distance was calculated 
from the present model. In the first measure we consider only 

Table 5. Means and variances for the y distribution fits of the 
UME(Ln-L)S computed for the N complexes for each lanthanide 
trication. 



UME,L„.L|S 



Lanthanide 



on 


N 


mean (A) 


variance (A^) p-value 


Dy'+ 


61 


0.0539 


0.0032 


0.7986 


Ho'+ 


40 


0.0602 


0.0069 


0.1292 


Er^+ 


58 


0.0506 


0.0025 


0.9082 
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Table 6. IVleans and variances for the y distribution fits for the 
UMEs computed for the N complexes for each lanthanide 
trication. 



UMEs 


Lanthanide 
ion 


N 


mean (A) 


variance (A^) 


p-value 




61 


0.1193 


0.0169 


0.1578 




40 


0.1225 


0.0290 


0.1463 




58 


0.1348 


0.0228 


0.5425 



doi:1 0.1 371 /journal.pone.0086376.t006 



distances between the central lanthanide ion and its directly 
coordinating atom distances, which we call UME(Lj,_l) In the 
second measure, which we call simply UME, we consider not only 
the lanthanide ion-directiy coordinating atoms as before, but also 
all distances between all atoms of the coordinating polyhedron, 
thus indirectly taking into account the angles within the 
coordination polyhedron. 



Tables 2-4 present UME(Ln_L)S and UMEs for the universe set 
of complexes for each of the lanthanide trications: Dy(III), Ho(III), 
and Er(III), identified by their respective CSD codes. 

We now proceed to the statistical validation of the model [43] . If 
the parameterizations captured the essence of the coordinating 
bonds, then the histograms of both UME(l„.l) and UME must 
follow gamma distribution functions since, by definition, the 
UMEs can only have positive values. The gamma distributions are 
then adjusted to reproduce the mean and variance of the UME(Ln. 
L)S, for each of the parameterized trications. Finally, the qualities of 
the gamma distribution fits of the data were then assessed via the 
one-sample nonparametric Kolmogorov-Smrrnoff test [44] . If the 
p-value of the test is larger than 0.05, then the gamma distribution 
fit is justified within a 95% interval and the use of the mean and 
variance of the data, as accuracy measures, is also statistically 
justified. Accordingly, Tables 5 and 6 display the mean, variance, 
and p-value of the test for each of the lanthanide ions, for both the 
UME(L„_L)S and UMEs. All p-values are substantially larger than 
0.05 and, therefore, the means and variances in Tables 5 and 6 
can justifiably be taken as accuracy measures of the models for 
Dy(III), Ho(III), and Er(III). 

We now proceed to analyze the performance of the models with 
respect to specific types of distances for Dy(III), Ho(III), and 



0.90 I 1 

0.80 UME (Ln-Ln) 

< 0.70 BRMI Sparl<le/RM1 ■ Sparkle/PIV17 ■ Sparl<le/PIVI6 ■ Sparkle/PIVIS ■Sparkle/AIVI1 

uT 0.30 H H 

^Eil - H ■ J"! ■ lTJ 

Dy Ho Er 

Figure 1. UiVIE(Ln-Ln)S obtained using the RIVI1 model for the lanthanides and all five versions of the Sparkle Model: Sparkle/RM1, 
Sparkle/PiVI7, Sparkle/PM6, Sparkle/PiVI3 and Sparkle/AMI for all complexes of the universe set for each of the lanthanide 
trications: Dy(lll], Ho(lll) and Er(lll). The UIVlEs are calculated as the absolute value of the difference between the experimental and calculated Ln- 
Ln interatomic distances, summed up for all complexes, for each of the lanthanides. 
doi:1 0.1 371 /journal.pone.0086376.g001 
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Figure 2. UME(Ln-o)^ obtained using the RM1 model for the lanthanides and all five versions of the Sparkle Model: Sparkle/RM1, 
Sparkle/PM7, Sparkle/PM6, Sparkle/PM3 and Sparkle/AMI for all complexes of the universe set for each of the lanthanide 
trications: Dy(lll), Ho(lll) and Er(lll). The UMEs are calculated as the absolute value of the difference between the experimental and calculated Ln- 
0 interatomic distances, summed up for all complexes, for each of the lanthanides. 
doi:1 0.1 371/journal.pone.0086376.g002 
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Figure 3. UM E(Ln-N)S obtained using tlie RMI model for the lanthanides and all five versions of the Sparkle Model: Sparkle/RM1, 
Sparkle/PM7, Sparkle/PM6, Sparkle/PM3 and Sparkle/AMI for all complexes of the universe set for each of the lanthanide 
trications: Dy(lll), Ho(lll) and Er(iii). The UiVlEs are calculated as the absolute value of the difference between the experimental and calculated Ln- 
N interatomic distances, summed up for all complexes, for each of the lanthanides. 
doi:1 0.1 371 /journal.pone.0086376.g003 



Table 7. RMI, Sparkle/RMl, Sparkle/PM7, Sparkle/PM6, Sparkle/PM3, and Sparkle/AMI unsigned mean errors for different types of 
distances of dysprosium{lll) complexes. 



Type of distances unsigned mean errors for specific types of distances (A) 





N 


RMI 


Sparkle/RMl 


Sparkle/PM7 


Sparkle/PM6 


Sparkle/PM3 


Sparkle/AMI 


Dy - Dy 


16 


0.2118 


0.2397 


0.5950 


0.5025 


0.1784 


0.2531 


Dy-0 


283 


0.0637 


0.0760 


0.0692 


0.1259 


0.0685 


0.0740 


Dy - N 


105 


0.0594 


0.0510 


0.2084 


0.0992 


0.0741 


0.0500 


Dy-C 


315 


0.0341 


0.1854 


0.2567 


0.4616 


0.2346 


0.2161 


Dy - S 


21 


0.0824 


0.5016 


0.9833 


0.5359 


0.4996 


0.5019 


Dy - P 


3 


0.0273 


0.1591 


2.3194 


0.3934 


0.4205 


0.3918 


Dy - CI 


20 


0.0525 


0.2858 


0.1757 


0.2318 


0.2475 


0.2546 


Dy - Br 


5 


0.0311 


0.4564 


1.3418 


0.3986 


0.4209 


0.4320 


Dy - L 


768 


0.0539 


0.1408 


0.2209 


0.2846 


0.1598 


0.1527 


L-L' 


3499 


0.1336 


0.2370 


0.2931 


0.3449 


0.2433 


0.2621 


Dy-L, Dy-Dy and L-L' 


4267 


0.1193 


0.2197 


0.2801 


0.3341 


0.2283 


0.2424 
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Table 8. RMI, Sparkle/RMl, Sparkle/PM7, Sparkle/PM6, Sparkle/PM3, and Sparkle/AMI unsigned mean errors for specific types of 
distances for holmium(lll) complexes. 



Type of distances unsigned mean errors for specific types of distances (A) 



N RMI Sparkle/RMl Sparkle/PM7 Sparkle/PM6 Sparkle/PM3 Sparkle/AMI 



Ho - Ho 


4 


0.1301 


0.2083 


0.2144 


0.2751 


0.1958 


0.2747 


Ho - 0 


219 


0.0475 


0.0604 


0.0554 


0.1021 


0.0639 


0.0557 


Ho - N 


58 


0.0786 


0.0696 


0.1592 


0.1732 


0.0585 


0.0469 


Ho - C 


98 


0.0752 


0.2256 


0.4544 


0.5380 


0.2537 


0.2655 


Ho - CI 


28 


0.0585 


0.3055 


0.1262 


0.2777 


0.2480 


0.2679 


Ho - L 


407 


0.0602 


0.1198 


0.1727 


0.2310 


0.1228 


0.1217 


L-L' 


1748 


0.1371 


0.2326 


0.3289 


0.3462 


0.2324 


0.2585 


Ho-L, Ho-Ho and L-L 


2155 


0.1225 


0.2113 


0.2994 


0.3245 


0.2117 


0.2327 
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Table 9. RMI, Sparkle/RMI, Sparkle/PMT, Sparkle/PM6, Sparkle/PMB, and Sparkle/AMl unsigned mean errors for specific types of 
distances for erbium complexes. 



Type of distances unsigned mean errors for specific types of distances (A) 





N 


RIVII 


Sparlde/RIVIl 


Sparl<le/PM7 


Sparlde/PIVie 


Sparl<le/PM3 


Sparlde/AIVII 


Er - Er 


6 


0.1200 


0.2639 


0.5458 


0.2124 


0.2439 


0.2626 


Er - 0 


336 


0.0509 


0.0730 


0.0874 


0.1285 


0.0657 


0.0689 


Er - N 


77 


0.0594 


0.0484 


0.0529 


0.0846 


0.0551 


0.0434 


Er - C 


96 


0.0318 


0.2004 


0.4280 


0.5474 


0.2277 


0.2177 


Er - S 


12 


0.1088 


0.4802 


1.2501 


0.5234 


0.521 1 


0.5212 


Er-CI 


33 


0.0574 


0.3243 


0.3406 


0.2753 


0.2975 


0.2928 


Er - Br 


3 


0.0463 


0.4526 


1 .5964 


0.4475 


0.4146 


0.4291 


Er - L 


563 


0.0511 


0.1199 


0.1967 


0.2136 


0.1205 


0.1189 


L - L' 


2259 


0.1575 


0.2197 


0.2938 


0.3519 


0.2076 


0.2371 


Er-L, Er-Er and L-L' 


2822 


0.1363 


0.2000 


0.2746 


0.3243 


0.1904 


0.2137 



doi:1 0.1 371 /journal.pone.0086376.t009 
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Figure 4. UME(i.n-c) obtained using the RM1 model for the lanthanides and all five versions of the Sparkle IVIodel: Sparkle/RMI, 
Sparkle/PIVI7, Sparkle/PM6, Sparkle/PIVI3 and Sparkle/AMl for all complexes of the universe set for each of the lanthanide 
trications: Dy(lll), l-lo(lll) and Er(iii). The UiVlEs are calculated as the absolute value of the difference between the experimental and calculated Ln- 
C interatomic distances, summed up for all complexes, for each of the lanthanides. 
doi:1 0.1 371 /journal.pone.0086376.g004 
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Figure 5. UME(Ln-s) obtained using the RM1 model for the lanthanides and all five versions of the Sparkle Model: Sparkle/RM1, 
Sparkle/PM7, Sparkle/PM6, Sparkle/PM3 and Sparkle/AMl for all complexes of the universe set for each of the lanthanide 
trications: Dy(lll), Ho(lll) and Er(iii). The UMEs are calculated as the absolute value of the difference between the experimental and calculated Ln- 
S interatomic distances, summed up for all complexes, for each of the lanthanides. There are no Ho-S distances in the universe of Ho(lll) complexes 
considered. 
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Figure 6. UME(i.„.ci)S obtained using the RM1 model for the lanthanides and all five versions of the Sparkle Model: Sparkle/RM1, 
Sparkle/PiVI7, Sparkle/PM6, Sparkle/PiVI3 and Sparkle/AMI for all complexes of the universe set for each of the lanthanide 
trications: Dy(lll), Ho(lll) and Er(lll). The UIVlEs are calculated as the absolute value of the difference between the experimental and calculated Ln- 
CI interatomic distances, summed up for all complexes, for each of the lanthanides. 
doi:1 0.1 371/journal.pone.0086376.g006 
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Figure 7. UME{Ln-Br)S obtained using the RMI model for the lanthanides and all five versions of the Sparkle Model: Sparkle/RM1, 
Sparkle/PM7, Sparkle/PM6, Sparkle/PM3 and Sparkle/AMI for all complexes of the universe set for each of the lanthanide 
trications: Dy(lll), Ho(lll) and Er(iii). The UMEs are calculated as the absolute value of the difference between the experimental and calculated Ln- 
Br interatomic distances, summed up for all complexes, for each of the lanthanides. The RMI model for lanthanides UME(Ln-Br) bars are very small. 
Besides, there are no Ho-Br distances in the universe of Ho(lll) complexes considered. 
doi:1 0.1 371 /journal.pone.0086376.g007 



Er(ni). Tables 7-9 and Figures 1-3 show UMEs for all types of 
Ln-L distances present in the universe of Ln(III) complexes, 
together with the corresponding values from the previous sparkle 
models for comparison. It must be noted, though, that in the 
original sparkle model articles, we only included complexes with 
exclusively Ln-O and Ln-N bonds. But, in the present article, we 



are considering a much larger set of complexes with other types of 
bonds such as Ln-C, Ln-CI, etc. Indeed, here we now may have 
complexes which have not only Ln-O and/or Ln-N bonds, but 
also other types of bonds, such as Ln-C bonds, all in the same 
compound. Of course, these were not included as test cases for the 
previous Sparkle models, but are here taken in full consideration. 
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Figure 8. UME(i._i.',s obtained using the RM1 model for the lanthanides and all five versions of the Sparkle Model: Sparkle/RM1, 
Sparkle/PM7, Sparkle/PM6, Sparkle/PM3 and Sparkle/AMI for all complexes of the universe set for each of the lanthanide 
trications: Dy(lll), Ho(lll) and Er(lll). The UMEs are calculated as the absolute value of the difference between the experimental and calculated 
interatomic distances between the coordinated atoms, L-L', summed up for all complexes, for each of the lanthanides. 
doi:1 0.1 371/journal.pone.0086376.g008 
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And that is the reason why UMEs for the Ln-O and Ln-N types of 
bonds in the present article tend to be different, shghdy larger, 
when set side by side to similar Ln-O and Ln-N UMEs of the 
original sparkle model articles. However, not to unnecessarily 
crowd the present article, in the tables, we only show numbers 
computed using the old models, but for the new test set. 

Dy-Dy distances in dUanthanide complexes of Dy (III), Ho (III), 
and Er (III) lie in the range from 3.6 A to 6.6 A, while lanthanide- 
other ligand atom distances lie on average around 2.5 A. That is 
why Ln-Ln UMEs are larger than other Ln-L UMEs. The 
previous Sparkle Models focused on these Ln-Ln, a also on Ln-O, 
and Ln-N distances only. Indeed, considering only Dy complexes 
(Table 7), there are 404 distances of these types, which represent 
53% of all distances involving Dy(III) in its universe of complexes. 
The next most important types are Dy(III)-C distances, for which 
there are 3 1 5 of them making up 4 1 % of the total. 

By examining Table 7 and Figure 4, we can see a significant 
improvement in these nc-xt types of distances, with UME (Dy-C)S for 
the RMl model for the lanthanides being 0.03 A, whereas the 
corresponding average value of the sparkle models is 0.27 A, a 
value 9 times larger. That alone would justify the introduction of 
the RMl model for the lanthanides because, in the case of 
dysprosium, almost half the extant Ln-L distances are significandy 
more accurately described by RMl. 

The situation is less dramatic but stUl significant for the other 
trications being parameterized, when Ln-C distances represent 
24% of the total for Ho(III), and 17% for Er(III). The RMl model 
for the lanthanides is even further justified when we compare other 
types of less common distances like Ln-S, Ln-Cl, and Ln-Br, 
because it outperforms all previous sparkle models as shown in 
Tables 7-9 and Figures 5-7. In all these instances, the RMl 
UMEs tend to be almost ten times smaller than the corresponding 
errors of all previous sparkle models. 

Finally, we can have an idea of the accuracy of the angles by 
examining the distances between any two atoms of the coordina- 
tion polyhedron, the L-L'distances. For all three lanthanide 
trications, there was a significant reduction of these UMEs by a 
factor of two when compared to the corresponding UMEs for the 
previous sparkle models: from 0.26 A to 0.13 A, as can be inferred 
from Figure 8. This is indirect evidence that the angles are much 
better predicted in the RMl model for the lanthanides. 

Conclusions 

The RMl model for the lanthanides represents a significant 
improvement in the theoretical semiempirical modeling of 
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lanthanide complexes, which started with the sparkle model in 
1994 [18,19] which and attained maturity with the introduction of 
Sparkle/ AMI in 2005 [23], and was extended to Sparkle/PM3 
[31,32], Sparkle/PM6 [33], to Sparkle/PM7 [34] and up to 
Sparkle/RMl [35], the last two in 2013. 

There is, however, a cost associated with the improvement. The 
RMl model for the lanthanides adds nine more orbitals per 
lanthanide to the calculation, whereas all sparkle models add none. 
Thus, for a single SCF calculation of a complex of about 60 atoms, 
for example, an RMl model for the lanthanides calculation takes 
about twice the computing time of a Sparkle/RMl calculation. 
Such cost can become even weightier if the molecular structure 
contains more than one lanthanide ion, as is usueiUy the case of 
metal-organic frameworks. Since the performance of both 
Sparkle/RMl and RMl model for the lanthanides is essentially 
erjuivalent for Ln-Ln, Ln-O, and Ln-N, the user may still benefit 
from the speed of the sparkle models if the structure of interest 
contains only these types of bonds, as it takes place in the majority 
of cases. 

In conclusion, via the introduction of a set of 5 d, 6 s, and 6 p 
semiempirical atomic orbitals, the RM 1 model for the lanthanides 
thus extends the Sparkle Models' capabilities of correctiy 
describing Ln-Ln, Ln-O, and Ln-N distances, to other types of 
distances, such as Ln-C, Ln-S, Ln-P, Ln-Cl, and Ln-Br, while 
simultaneously improving the coordinating bond angles. 
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